Generalization of Force Control Policies from Demonstrations for Constrained Robotic Motion Tasks - A Regression-Based Approach
نویسندگان
چکیده
Although learning of control policies from demonstrations has been thoroughly investigated in the literature, generalization of policies to new contexts still remains a challenge given that existing approaches exhibit limited performance when generalizing to new tasks. In this article, we propose two policy generalization approaches employed for generalizing motion-based force control policies with the view of performing constrained motions in presence of motion-dependent external forces. The key concept of the proposed methods is using, apart from policy values, also policy derivatives or differences which express how the policy varies with respect to variations in its input and combine these two kinds of information to generalize the policy at new inputs. The first proposed approach learns policy and policy derivative values by linear regression and combines these data into a first-order Taylor-like polynomial to estimate the policy at new inputs. The second approach learns policy and policy difference data by locally weighted regression and combines them in a V. Koropouli is with the Institute of Automatic Control Engineering, Technische Universität München, Karlstr. 45, 80333 Munich, Germany Tel.: +49-89-28926885 E-mail: [email protected] S. Hirche is with the Institute for Information-Oriented Control, Technische Universität München, Barer str. 21, 80333 Munich, Germany E-mail: [email protected] D. Lee is with the Institute of Automatic Control Engineering, Technische Universität München, Karlstr. 45, 80333 Munich, Germany E-mail: [email protected] 2 Vasiliki Koropouli et al. superposition fashion to estimate the policy at new inputs. The policy differences in this approach represent variations of the policy in the direction of minimizing the distance between the new incoming and average-demonstrated inputs. The proposed approaches are evaluated in real-world robot constrained motion tasks by using a linear-actuated, two degrees-offreedom haptic device.
منابع مشابه
Learning Constrained Generalizable Policies by Demonstration
Many practical tasks in robotic systems, such as cleaning windows, writing or grasping, are inherently constrained. Learning policies subject to constraints is a challenging problem. We propose a locally weighted constrained projection learning method (LWCPL) that first estimates the constraint and then exploits this estimate across multiple observations of the constrained motion to learn an un...
متن کاملManipulation Control of a Flexible Space Free Flying Robot Using Fuzzy Tuning Approach
Cooperative object manipulation control of rigid-flexible multi-body systems in space is studied in this paper. During such tasks, flexible members like solar panels may get vibrated that in turn may lead to some oscillatory disturbing forces on other subsystems, and consequently produces error in the motion of the end-effectors of the cooperative manipulating arms. Therefore, to design and dev...
متن کاملLearning Dexterous Manipulation for a Soft Robotic Hand from Human Demonstration
Dexterous multi-fingered hands can accomplish fine manipulation behaviors that are infeasible with simple robotic grippers. However, sophisticated multi-fingered hands are often expensive and fragile. Low-cost soft hands offer an appealing alternative to more conventional devices, but present considerable challenges in sensing and actuation, making them difficult to apply to more complex manipu...
متن کاملTrajectory Optimization of Cable Parallel Manipulators in Point-to-Point Motion
Planning robot trajectory is a complex task that plays a significant role in design and application of robots in task space. The problem is formulated as a trajectory optimization problem which is fundamentally a constrained nonlinear optimization problem. Open-loop optimal control method is proposed as an approach for trajectory optimization of cable parallel manipulator for a given two-end-po...
متن کاملLearning Deep Policies for Physics-Based Manipulation in Clutter
Uncertainty in modeling real world physics makestransferring traditional open-loop motion planning techniquesfrom simulation to the real world particularly challenging.Available closed-loop policy learning approaches, for physics-based manipulation tasks, typically either focus on single objectmanipulation, or rely on imitation learning, which inherentlyconstrains task g...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Journal of Intelligent and Robotic Systems
دوره 80 شماره
صفحات -
تاریخ انتشار 2015